Window - based Data Processing with Stratosphere
نویسنده
چکیده
Analyzing large amounts of ordered data is a common task in research and industry. The usual ordering domain is time: Examples for time-ordered data are sensor data, communication network data, or financial data. Besides online monitoring, it is common to investigate patterns or special events in the data after capturing it. These analysis can traditionally be performed within Data Stream Management Systems or Relational Database Management Systems. We decided to use the parallelization framework Stratosphere: By design, Stratosphere provides scalability by using clusters or clouds for computations. For ordered data analysis, sliding window semantics are necessary, which are not yet included within the operators of Stratosphere. In this work, we describe sliding window semantics from streaming databases and define Stratosphere operators with sliding window semantics. We introduce an exemplary implementation of one sliding window operator and evaluate its performance. The results show that Stratosphere with sliding window operators is a good choice for analysis on large amounts of ordered data. Augmented with the proposed sliding window operators, the applicability of Stratosphere gets broadened towards an even more general-purpose parallelization framework.
منابع مشابه
Adaptive Segmentation with Optimal Window Length Scheme using Fractal Dimension and Wavelet Transform
In many signal processing applications, such as EEG analysis, the non-stationary signal is often required to be segmented into small epochs. This is accomplished by drawing the boundaries of signal at time instances where its statistical characteristics, such as amplitude and/or frequency, change. In the proposed method, the original signal is initially decomposed into signals with different fr...
متن کاملApplying Stratosphere for Big Data Analytics
Analyzing big data sets as they occur in modern business and science applications requires query languages that allow for the specification of complex data processing tasks. Moreover, these ideally declarative query specifications have to be optimized, parallelized and scheduled for processing on massively parallel data processing platforms. This paper demonstrates the application of Stratosphe...
متن کاملImproved Snr Of Mst Radar Signals By Kaiser-Hamming And Cosh-Hamming Window Functions
In this paper the effect of window shape parameter „α‟ in Kaiser-Hamming and Cosh-Hamming window functions on the signal to noise ratio (SNR) values of the Indian Mesosphere-Stratosphere-Troposphere (MST) radar is computed. The six parts of multibeam observations of the lower atmosphere made by the MST radar are utilized for the analysis of results. Prior to the Fourier transformation, the in-p...
متن کاملA New Vision-Based and GPS-Signal-Independent Approach in Jamming Detection and UAV Absolute Positioning Assessment
The Unmanned Aerial Vehicles (UAV) positioning in the outdoor environment is usually done by the Global Positioning System (GPS). Due to the low power of the GPS signal at the earth surface, its performance disrupted in the contaminated environments with the jamming attacks. The UAV positioning and its accuracy using GPS will be degraded in the jamming attacks. A positioning error about tens of...
متن کاملبرآورد و ارزیابی ارزش در معرض ریسک بورس اوراق بهادار تهران بر مبنای روش شبیه سازی پنجره
Value at risk (VaR) is one of the most important risk measures for computing risk which is entered in financial framework in past two decades. In general there are three approaches including parametric, nonparametric and semi-parametric is used for estimating of VaR. this paper present a new method that is named window simulation which is classified in nonparametric approach. Processing of VaR ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013